30 research outputs found
Continuity of singular perturbations in the graph topology
AbstractFor a certain model for singular perturbations in control systems, which we motivate by a simple example, we show that under weak assumptions continuity in the graph topology holds as the perturbation parameter tends to zero. This may be contrasted with a result by Cobb, who considered a different model for singular perturbations and who found a strong condition to be necessary for continuity in that model. Our proof techniques are based on the characterization (due to Qiu and Davison) of the graph topology as a topology of uniform convergence
Handwritten Text Recognition for Historical Documents in the tranScriptorium Project
""漏 Owner/Author 2014. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in ACM, In Proceedings of the First International Conference on Digital Access to Textual Cultural Heritage (pp. 111-117) http://dx.doi.org/10.1145/2595188.2595193Transcription of historical handwritten documents is a crucial
problem for making easier the access to these documents
to the general public. Currently, huge amount of historical
handwritten documents are being made available by on-line
portals worldwide. It is not realistic to obtain the transcription
of these documents manually, and therefore automatic
techniques has to be used. tranScriptorium is
a project that aims at researching on modern Handwritten
Text Recognition (HTR) technology for transcribing historical
handwritten documents. The HTR technology used in
tranScriptorium is based on models that are learnt automatically
from examples. This HTR technology has been
used on a Dutch collection from 15th century selected for
the tranScriptorium project. This paper provides preliminary
HTR results on this Dutch collection that are very
encouraging, taken into account that minimal resources have
been deployed to develop the transcription system.The research leading to these results has received funding from the European Union鈥檚 Seventh Framework Programme (FP7/2007-2013) under grant agreement no. 600707 - tranScriptorium and the Spanish MEC under the STraDa (TIN2012-37475-C02-01) research project.S谩nchez Peir贸, JA.; Bosch Campos, V.; Romero G贸mez, V.; Depuydt, K.; De Does, J. (2014). Handwritten Text Recognition for Historical Documents in the tranScriptorium Project. ACM. https://doi.org/10.1145/2595188.2595193
The Database of the Southern Dutch Dialects: new opportunities for digital lexicological research
The Southern Dutch dialect area consists of four dialect groups; the Flemish, Brabantic,
Limburgian and the Zeeland dialects, partly found in the Netherlands, in Belgium and in NordPas-de-Calais (France). Each of these dialects has been described in a separate dictionary. The
Flemish, Brabantic and Limburgian dialect dictionaries are similarly constructed
onomasiological dialect dictionaries, whereas the Zeeland dialect dictionary is semasiologically
arranged.
The Database of the Southern Dutch Dialects (DSDD) project, initiated by Ghent University and
undertaken in close collaboration with the Dutch Language Institute, aimed to combine the
onomasiological dictionaries into one platform accessible for both dialect researchers and a
wider audience. Following the launch of the DSDD platform in 2020, dialect researchers can
now make use of extensive search functionalities and geo-visualise their results on an
interactive map.
The DSDD enables searching for individual dialect words (e.g. puit 'frog'), for all dialect words
from a certain municipality or region (e.g. all dialect words from the city of Antwerp) or for all
dialect words belonging to a certain concept (e.g. 'frog') or theme (fauna). Filters are available
to make the search query as accurate as possible.
Based on the search results, interactive maps can be compiled, visualising the data in its
geographical setting. The results can be clustered in various ways, using, for instance, frequency
or etymologically related entries. The maps are flexible enabling the user, for example, to select
the required level of detail to be visualised for the chosen region. A set of symbols and colours
is available and can be arranged according to the user鈥檚 needs. The area map can be enlarged
or reduced in size and printed or downloaded as a PDF file. In the future, the alphabetical dialect
data from the Woordenboek der Zeeuwse Dialecten will be added to the platform.
References
Van Keymeulen, J., V. De Tier, R. Vandenberghe & S. Chambers (2019), The dictionary of the
Southern Dutch Dialects (DSDD): designing a virtual research environment for digital
lexicological research. in: Dialectologia. Special issue, 8 (2019), 93-115
Sustaining the Dictionary of the Southern Dutch Dialects (DSDD) : a case study for CLARIN and DARIAH
Modelling frequency and attestations for OntoLex-Lemon
The OntoLex vocabulary enjoys increasing popularity as a means of publishing lexical resources with RDF and as Linked Data. The
recent publication of a new OntoLex module for lexicography, lexicog, reflects its increasing importance for digital lexicography.
However, not all aspects of digital lexicography have been covered to the same extent. In particular, supplementary information drawn
from corpora such as frequency information, links to attestations, and collocation data were considered to be beyond the scope of
lexicog. Therefore, the OntoLex community has put forward the proposal for a novel module for frequency, attestation and corpus
information (FrAC), that not only covers the requirements of digital lexicography, but also accommodates essential data structures for
lexical information in natural language processing. This paper introduces the current state of the OntoLex-FrAC vocabulary, describes
its structure, some selected use cases, elementary concepts and fundamental definitions, with a focus on frequency and attestations
tranScriptorium: a european project on handwritten text recognition
The tranScriptorium project aims to develop innovative,
efficient and cost-effective solutions for annotating handwritten
historical documents using modern, holistic Handwritten
Text Recognition (HTR) technology. Three actions are
planned in tranScriptorium: i) improve basic image preprocessing
and holistic HTR techniques; ii) develop novel indexing
and keyword searching approaches; and iii) capitalize
on new, user-friendly interactive-predictive HTR approaches
for computer-assisted operation.The research leading to these results has received funding from the European Union鈥檚 Seventh Framework Programme (FP7/2007-2013) under grant agreement no. 600707 - tranScriptorium.S谩nchez Peir贸, JA.; M眉hlberger, G.; Gatos, B.; Schofield, P.; Depuydt, K.; Davis, RM.; Vidal, E.... (2013). tranScriptorium: a european project on handwritten text recognition. ACM. https://doi.org/10.1145/2494266.2494294